Insights into the Loblolly Pine Genome: Characterization of BAC and Fosmid Sequences
نویسندگان
چکیده
Despite their prevalence and importance, the genome sequences of loblolly pine, Norway spruce, and white spruce, three ecologically and economically important conifer species, are just becoming available to the research community. Following the completion of these large assemblies, annotation efforts will be undertaken to characterize the reference sequences. Accurate annotation of these ancient genomes would be aided by a comprehensive repeat library; however, few studies have generated enough sequence to fully evaluate and catalog their non-genic content. In this paper, two sets of loblolly pine genomic sequence, 103 previously assembled BACs and 90,954 newly sequenced and assembled fosmid scaffolds, were analyzed. Together, this sequence represents 280 Mbp (roughly 1% of the loblolly pine genome) and one of the most comprehensive studies of repetitive elements and genes in a gymnosperm species. A combination of homology and de novo methodologies were applied to identify both conserved and novel repeats. Similarity analysis estimated a repetitive content of 27% that included both full and partial elements. When combined with the de novo investigation, the estimate increased to almost 86%. Over 60% of the repetitive sequence consists of full or partial LTR (long terminal repeat) retrotransposons. Through de novo approaches, 6,270 novel, full-length transposable element families and 9,415 sub-families were identified. Among those 6,270 families, 82% were annotated as single-copy. Several of the novel, high-copy families are described here, with the largest, PtPiedmont, comprising 133 full-length copies. In addition to repeats, analysis of the coding region reported 23 full-length eukaryotic orthologous proteins (KOGS) and another 29 novel or orthologous genes. These discoveries, along with other genomic resources, will be used to annotate conifer genomes and address long-standing questions about gymnosperm evolution.
منابع مشابه
Sequencing and Assembly of the 22-Gb Loblolly Pine Genome
Conifers are the predominant gymnosperm. The size and complexity of their genomes has presented formidable technical challenges for whole-genome shotgun sequencing and assembly. We employed novel strategies that allowed us to determine the loblolly pine (Pinus taeda) reference genome sequence, the largest genome assembled to date. Most of the sequence data were derived from whole-genome shotgun...
متن کاملAdventures in the Enormous: A 1.8 Million Clone BAC Library for the 21.7 Gb Genome of Loblolly Pine
Loblolly pine (LP; Pinus taeda L.) is the most economically important tree in the U.S. and a cornerstone species in southeastern forests. However, genomics research on LP and other conifers has lagged behind studies on flowering plants due, in part, to the large size of conifer genomes. As a means to accelerate conifer genome research, we constructed a BAC library for the LP genotype 7-56. The ...
متن کاملDetermining the best form factor formula for Loblolly Pine (Pinus taeda L.) plantations at the age of 18, in Guilan- northern Iran
In order to determine the best form factor formula for Loblolly Pine (Pinus taeda L.) plantations in Talesh (Western Guilan province-Iran), a number of 110 trees were selected based on their distribution in diameter classes, from 12 to 34 cm (in a two- cm diameter interval). First, several quantitative factors including diameter at breast height, diameter at 0.65 m of height, and diameter at st...
متن کاملGenetic Analysis of a Disease Resistance Gene from Loblolly Pine
Rapid advances in molecular genetics provide great opportunities for studies of host defense mechanisms. Examination of plant responses to disease at the cellular and molecular level permits both discovery of changes in gene expression in the tissues attacked by pathogens, and identification of genetic components involved in the interaction between host and pathogens. Expression of specific pro...
متن کاملWhole-genome characterization of embryonic stage inbreeding depression in a selfed loblolly pine family.
Inbreeding depression is important in the evolution of plant populations and mating systems. Previous studies have suggested that early-acting inbreeding depression in plants is primarily due to lethal alleles and possibly epistatic interactions. Recent advances in molecular markers now make genetic mapping a powerful tool to study the genetic architecture of inbreeding depression. We describe ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 8 شماره
صفحات -
تاریخ انتشار 2013